Link the Head to the "Beak": Zero Shot Learning from Noisy Text Description at Part Precision
نویسندگان
چکیده
In this paper, we study learning visual classifiers from unstructured text descriptions at part precision with no training images. We propose a learning framework that is able to connect text terms to its relevant parts and suppress connections to non-visual text terms without any part-text annotations. For instance, this learning process enables terms like “beak” to be sparsely linked to the visual representation of parts like head, while reduces the effect of non-visual terms like “migrate” on classifier prediction. Images are encoded by a part-based CNN that detect bird parts and learn part-specific representation. Part-based visual classifiers are predicted from text descriptions of unseen visual classifiers to facilitate classification without training images (also known as zero-shot recognition). We performed our experiments on CUBirds 2011 dataset and improves the state-of-the-art textbased zero-shot recognition results from 34.7% to 43.6%. We also created large scale benchmarks on North American Bird Images augmented with text descriptions, where we also show that our approach outperforms existing methods. Our code, data, and models are publically available link [1].
منابع مشابه
Imagine it for me: Generative Adversarial Approach for Zero-Shot Learning from Noisy Texts
Most existing zero-shot learning methods consider the problem as a visual semantic embedding one. Given the demonstrated capability of Generative Adversarial Networks(GANs) to generate images, we instead leverage GANs to imagine unseen categories from text descriptions and hence recognize novel classes with no examples being seen. Specifically, we propose a simple yet effective generative model...
متن کاملOrdinal Zero-Shot Learning
Zero-shot learning predicts new class even if no training data is available for that class. The solution to conventional zero-shot learning usually depends on side information such as attribute or text corpora. But these side information is not easy to obtain or use. Fortunately in many classification tasks, the class labels are ordered, and therefore closely related to each other. This paper d...
متن کاملSupplementary Material: An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild
– Section 1: Hyper-parameter tuning strategies (Section 4.2 of the main text). – Section 2: Novelty detection approaches: details and additional results (Section 4.3 and 5.2 of the main text). – Section 3: Comparison between zero-shot learning approaches: additional ZSL algorithm, dataset, and results (Section 5 of the main text). – Section 4: Analysis on (generalized) zero-shot learning: detai...
متن کاملThe Comparative Effect of Antonym in-Text Glosses and Description in-Text Glosses on EFL Learners' Reading Comprehensio
The present study was carried out to investigate the comparative effect of antonym in-text glosses and description in-text glosses on a group of Iranian EFL learners' reading comprehension. To fulfill the purpose of this study, 60 female intermediate students between 18 and 19 years old were selected among a total number of 90 through their performance on a piloted PET. These 60 participants we...
متن کاملمقایسهی دقیقآموزی و آموزش مستقیم: دو روش مبتنی بر رویکرد رفتارگرایی در اختلال یادگیری
Background: The purpose of the present study was to compare the two methods derived from the behavioral approach, precision teaching and direct instruction with an emphasis on their application in the area of learning disorder. To do this, using the systematic review method by searching the following words (learning disability, learning disorder, precision teaching, direct instruction) in scien...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017